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Abstract. In a previous paper [T], we investigated the Blind Source Separation 
(BSS) problem, for the nonlinear mixing model that we introduced in that paper. We 
proposed to solve this problem by using a maximum likelihood (ML) approach. When 
applying the ML approach to BSS problems, one usually determines the analytical 
expressions of the derivatives of the log-likelihood with respect to the parameters of the 
considered mixing model. In the literature, these calculations were mainly considered 
for linear mixtures up to now. They are more complex for nonlinear mixtures, due to 
dependencies between the considered quantities. Moreover, the notations commonly 
employed by the BSS community in such calculations may become misleading when using 
them for nonlinear mixtures, due to the above-mentioned dependencies. In this document, 
we therefore explain this phenomenon, by showing the effect of indirect dependencies on 
the application of the ML approach to the mixing model considered in [Ij. This yields 
the explicit expression of the complete derivative of the log-likelihood associated to that 
mixing model. 
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1 Data model 

In a previous paper [T], we investigated a Blind Quantum Source (or Signal) Separation 
(BQSS) problem. More precisely, we aimed at restoring two quantum states (qubits) 
after they have been coupled, i.e. after they have been "mixed", using the classical Blind 
Source Separation (BSS) terminology. The considered coupling was based on a cylindrical- 
symmetry Heisenberg model. 

We showed that repeated initializations (i.e. preparations) and measurements per- 
formed with these coupled qubits resulted in an "observation vector" which may be de- 
noted X = [xi, rE2, 2:3]^ using standard BSS notations, where stands for transpose. In 
the considered problem, the components of this vector are equal to xi = pi,X2 = P2 and 
^3 = PSi where the quantities pj are defined in [1] . 



Moreover, we proved that the components of the above observation vector may be 
expressed as nonhnear combinations (i.e. nonhnear "mixtures", in BSS terms) of a set 
of "source signals". Using standard BSS notations, the vector composed of these source 
signals reads s = [31,82,83]'^. In the considered problem, the components of this vector 
are equal to si = ri,S2 = T2 and S3 = A/, where the right-hand terms of these equations 
are defined in [Ij. 

The "mixing model" then consists of the equations which define how the components of 
the observation vector are expressed with respect to (i) the components of the source vector 
and (ii) the mixing parameter (s). By modelling the considered quantum configuration, we 
showed in [IJ that this configuration involves a single mixing parameter, denoted v. The 
mixing equations then read (see (18), (24), (25) in pj) 

rjr^ = pi (1) 
{l-rl){l-4) = p2 (2) 

rlil - rl){l - v^) + {1 - rl)4v^ 

— 2rir2\Jl — rf^Jl — rlVl — f^vsin A/ = p^. (3) 

This mixing model may also be expressed in compact form as 

X = g{s) (4) 

where the nonlinear mixing function g has three components gi to g^, with Xi = gi{s), 
Vi G {1 . . . 3}. These components gi are respectively defined by ([1]), ([2]) and ([3]). 

Eq. (j3]) focuses on the signals (i.e. sources and observations). It hides the fact that the 
observations also depend on the parameters of the mixing model, i.e. on v in the model 
considered here. This additional dependency can be made explicit, by rewriting Q as 

x = g{s,v). (5) 

The latter form of the mixing model is better suited to the maximum likelihood approach 
considered below in this document. 



2 Previously reported results for maximum likelihood ap- 
proach 

In our BSS problem, we aim at retrieving a sequence of unknown source vectors s from the 
corresponding sequence of measured observation vectors x and from the mixing parameter 
V, which is also initially unknown. This parameter v should therefore be estimated before 
proceeding to the source restoration step. 

In [T] , we investigated the estimation of v by means of the maximum likelihood (ML) 
approach. While we detailed this procedure in [1], we here only summarize its features 
which are of importance for developing new aspects of this ML approach further in this 
document. 

The function used to estimate v is the (normalized) log-likelihood of the considered 
data. Under some assumptions [I], the log-likelihood here reads (see (34) in [Ij) 

£ = j2Et[lnfsAsm] - Et[ln\Jg{sm] (6) 

i=l 



2 



where Et[.] represents temporal averaging over the sequence of available data, /s, (.) are 
the probability density functions of the source signals and Jg{s) is the Jacobian of the 
mixing function g. For the function g considered in this investigation, we have (see (28) 
inP) 



Jg(s) = 8rjrj{rl-rl)^l-rl^l-r^Vl-v'^vcosAi. (7) 

When applying the ML approach to a parameter estimation problem, the value selected 
for the set of parameters to be estimated is the value which maximizes the log-likelihood 
C. In order to determine the location of this maximum, one usually considers the gradient 
of C ( see e.g. [2]). In our configuration, only a single parameter is to be estimated, namely 
V. Therefore, the gradient of C is here restricted to the derivative of C with respect to v. 
In [1], we denoted this gradient or derivative by using the notation most often employed 
in the BSS community (see e.g. p]), i.e. We keep this notation in this section, in 
order to clearly refer to the equations available in [1] , but in Section [3] we will show that it 
may be misleading and we will therefore introduce another notation in Section [3l In [l], 
we showed that this gradient reads (see (37) in [Ij) 

^ = (8) 



where (see (36) in [Ij) 
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1=1 3 
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Mu)= ViG{1...3} (9) 



are the score functions of the source signals. 

The last stage of this investigation consists in deriving the expressions of all the terms 

of the right-hand part of ([8]). In [1], we showed that the terms Et[ilJi{si)-^] corresponding 

to i = 1 and i = 2 are equal to zero. The term corresponding to i = 3 was derived from 
equations (38) to (40) in [1], which read 

dF ds3 9F _^ 
ds3 dv dv 



with 



dF 
ds3 



2rir2\/ 1 — rfJ 1 — r^Vl — v'^v cos A/ (11) 



and 



^ = 2v{4 - rl) - 2rir2^l -t\^\- ri^=ijsin A/. (12) 
For the sake of clarity, we here provide the resulting expression of which is also 



used below in the current document. The above equations yield 



^^3 

dv 



dF 



ds:\ 



-1 



dF 

dv 

(r^-rf) (1 - 2t!2) sin A/ 



rir2 



1 - rhfl^VT^ cos Aj {l-v^)vcosAi' 



(13) 
(14) 



The last term that should be determined to obtain the complete expression in ([S]), i.e. 
dJ 

in (37) of [Ij, is its term Equation (41) of [T], we provided an explicit expression 
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dJ 

that we denoted — We here aim at warning the reader that, although each of the 
ov 

equations (37) and (41) of [IJ is correct in itself if interpreted correctly, there may be a 

dJ 

misunderstanding when considering these equations together, because the notation — 

ov 

dJ 

does not have the same meaning in both of them. Briefly, — represents a total derivative 

ov 

in (37) of PP, but a partial derivative in (41) of jT|, as detailed below in Section O The 
right-hand expression in Equation (41) of [Ij is therefore only one of the terms which 

dJ 

compose the complete expression of to be used in (37) of [1]. In the following section 

of the current document, we clarify this point and we determine the complete expression 
dJ 

of of (37) of i.e. of ([8]) of the current document. 

3 New results for maximum likelihood approach 

When applying the ML approach to any BSS configuration, the log-likelihood C is consid- 
ered for the fixed set of observed vectors. The only independent variable in this approach 
is the set of mixing parameters to be estimated, which is here restricted to v. The source 
vectors are dependent variables, here linked to the observations and to by ([5]). The 
overall variations of the log-likelihood C with respect to v result from two types of terms 
contained in the expression of C, i.e. (i) the terms involving v itself and (ii) the terms 
involving the source signals si, S2 and S3, which are here considered as functions of v and 
may therefore be denoted as si{v),S2{v) and S'i{v) for the sake of clarity. 

This approach should be kept in mind when interpreting all equations in [1] , which were 
partly gathered in Section [2] of the current document. Especially, the log-likelihood itself, 
which appears in the left-hand term of ([61), may be denoted as C{v ^ si{v) , S2{v) , s^{v)) 
for the sake of clarity. In order to determine the location of the maximum of this log- 
likelihood, one should then consider the total derivative of C{v , si{v) , S2{v) , s^{v)) with 
respect to v. The notations with partial derivatives in ([H]) may therefore be misleading, as 
confirmed below. Therefore, ([8]) should preferably be rewritten as 

f =-i:^.[*(».)^i-E,[^^i (15) 

1=1 y 



with 

ihA ii\ = ■ 

du 



Mu) = -'-^^^ VzE{1...3}. (16) 



The term — in (|15|) then deserves some care because, as shown by d?]), the Jacobian 
dv 

Jg contains the above-defined two types of dependencies with respect to v, i.e. (i) direct 
dependencies due to the factors in ([7]) which explicitly contain v and (ii) indirect depen- 
dencies due to the factors in ([7]) which depend on the source signals, which themselves 

depencJ3 on v in the ML approach. We here have to consider the total derivative — 

dv 



'^When applying the ML approach to a general mixing model, all source signals depend on all mixing 
parameters . For the specific mixing model considered here, we will show below that only one of the three 
source signals actually depends on the mixing parameter. 
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which takes into account both types of dependencies, and which therefore reads 

dJg ^'^g _|_ ^"^g (17) 



dJa 



In this expression, — is the partial derivative of J„ with respect to v, calculated by 
dv 

considering that the source signals are constant. This partial derivative is the quantity 

dJ 

that we provided in Equation (41) of [T], where we also denoted it as -;— ^. However, since 

ov 

dJ 

we independently used the same notation in the last term of (37) of [T] (that we repeat 

as (jH]) of the current document), we may have incorrectly suggested to the reader that this 
last term of (37) of [1] is equal to the above-mentioned partial derivative. Instead, let us 

dJ 

insist again that the partial derivative in (41) of [Tj is first to be added with the other 

terms in the right-hand part of (1171) . in order to obtain the overall total derivative — 

dv 

defined by (fT7|) . What should eventually be used in the last term of dH) or (fT5|) is this 
total derivative. 

So, back to the calculation of all terms of the total derivative — in (1171) . the first term 

dv 

is available from Equation (41) of [T] and reads 

^ = Srlrlirl - rl)^^,,J^,^^ cos A,. (18) 

The other three terms of (I17p involve the derivative^ ^ . Their calculation first require 
us to determine the expressions of the source signals si = ri,S2 = r2 and S3 = A/ with 
respect to the observations and mixing parameter v. The first two source signals, i.e. 
si = ri and S2 = ^2 are obtained by solving ([1]) and ([2]). These equations are independent 
of V. Therefore, si and §2 are also independent of v and yield 

ds\ , dso 

— i = and ^ = 0. (19 

dv dv 

The derivative ^ was already provided above ird (jl4p . That derivative is used in the 

dJ 

term of (|17p related to the third source signal, together with the partial derivative -77-^, 
which is obtained from ([7]) and reads 

dJ, 



—1. = -8rjrj{rj - rj)Jl - rfJl - r^vT^^v sin A/. (20) 



Inserting ([T8]), ([T9]), (120]) and ([mI) in ([TTl) yields 

^ = ^rirUrl - rf^^,^,^-2- _ g.,,,,.? - r^?.^. ,21) 



^Unlike considered above, the derivatives ^ do not yield any risk of ambiguity between partial 
and total derivatives: each considered signal Si is here considered independently from the other source 
signals and only involves a direct dependency with respect to v. 

^Due to the type of notations employed throughout Section [2l we used a partial derivative notation 
in (I14|l . Anyway, the quantity considered in that equation (I14p is the same as addressed here, as 
explained in the previous footnote. 
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This completes the comment that we aimed at providing in this document, concerning 
the total derivative to be used in the last term of (llSp . 

For the sake of clarity, we now conclude by providing the explicit expression of the 
derivative of the log-likelihood which results from the complete expression (121 p . Using 
this expression together with d?]), (fH|) and ([19]) allows us to rewrite (fT5]) as 



dC 

dv 



— = -Et 




(1 - 2^2) sin A/ 
yr^cosA/ {l-v'^)vcos/\i 



(rl — rD sin A/ 



cos 



2 A, 



(22) 
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